RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/426,509 



TEAM 7 



DATE: 08/28/95 
TIME: 13:46:12 



INPUT SET: SS863.mw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



SEQUENCE LISTING 



General Information: 



OTERED 



(i) APPLICANT : Ullrich, Axel 

Gishizsky, Mikhail 
Sures, Irman G. 



(ii) TITLE OF INVENTION: Novel Megakaryocytic Protein Tyrosine 
Kinases 



(iii) NUMBER OF SEQUENCES: 21 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/426,509 

(B) FILING DATE: 21-APR-1995 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/232,545 

(B) FILING DATE: 22-APR-1994 

(C) CLASSIFICATION: 

viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Coruzzi, Laura A. 

(B) REGISTRATION NUMBER: 30,742 

(C) REFERENCE/ DOCKET NUMBER: 7683-074 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212)7 90-9090 

(B) TELEFAX: (212)869-9741 

(C) TELEX: 66141 PENNIE 
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RAW SEQUENCE LISTING DATE: 08/28/95 

PATENT APPLICATION US/08/426,509 TIME: 13:46:15 

INPUT SET: SS863.raw 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CTCGCTCCAA GTTGTGCAGC CGGGACCGCC TCGGGGTGTG CAGCCGGCTC GCGGAGGCCC 60 

TCCTGGGGGC GGGCGCGGGG CGGCTCGGGG GCGCCCCCTG AGCAGAAAAC AGGAAGAACC 120 

AGGCTCGGTC CAGTGGCACC CAGCTCCCTA CCTCCTGTGC CAGCCGCCTG GCCTGTGGCA 180 

GGCCATTCCC AGCGTCCCCG ACTGTGACCA CTTGCTCAGT GTGCCTCTCA CCTGCCTCAG 240 

TTTCCCTCTG GGGGGCGATG GCGGGGCGAG GCTCTCTGGT TTCCTGGCGG GCATTTCACG 300 

GCTGTGATTC TGCTGAGGAA CTTCCCCGGG TGAGCCCCCG CTTCCTCCGA GCCTGGCACC 360 

CCCCTCCCGT CTCAGCCAGG ATGCCAACGA GGCGCTGGGC CCCGGGCACC CAGTGTATCA 420 

CCAAATGCGA GCACACCCGC CCCAAGCCAG GGGAGCTGGC CTTCCGCAAG GGCGACGTGG 480 

TCACCATCCT GGAGGCCTGC GAGAACAAGA GCTGGTACCG CGTCAAGCAC CACACCAGTG 540 

GACAGGAGGG GCTGCTGGCA GCTGGGGCGC TGCGGGAGCG GGAGGCCCTC TCCGCAGACC 600 

CCAAGCTCAG CCTCATGCCG TGGTTCCACG GGAAGATCTC GGGCCAGGAG GCTGTCCAGC 660 

AGCTGCAGCC TCCCGAGGAT GGGCTGTTCC TGGTGCGGGA GTCCGCGCGC CACCCCGGCG 720 

ACTACGTCCT GTGCGTGAGC TTTGGCCGCG ACGTCATCCA CTACCGCGTG CTGCACCGCG 780 

ACGGCCACCT CACAATCGAT GAGGCCGTGT TCTTCTGCAA CCTCATGGAC ATGGTGGAGC 840 

ATTACAGCAA GGACAAGGGC GCTATCTGCA CCAAGCTGGT GAGACCAAAG CGGAAACACG 900 

GGACCAAGTC GGCCGAGGAG GAGCTGGCCA GGGCGGGCTG GTTACTGAAC CTGCAGCATT 960 

TGACATTGGG AGCACAGATC GGAGAGGGAG AGTTTGGAGC TGTCCTGCAG GGTGAGTACC 1020 

TGGGGCAAAA GGTGGCCGTG AAGAATATCA AGTGTGATGT GACAGCCCAG GCCTTCCTGG 1080 



PAGE: 3 RAW SEQUENCE LISTING DATE: 08/28/95 

PATENT APPLICATION US/08/426,509 TIME: 13:46:18 

INPUT SET: S5863.mw 

100 ACGAGACGGC CGTCATGACG AAGATGCAAC ACGAGAACCT GGTGCGTCTC CTGGGCGTGA 1140 
101 

102 TCCTGCACCA GGGGCTGTAC ATTGTCATGG AGCACGTGAG CAAGGGCAAC CTGGTGAACT 1200 
103 

104 TTCTGCGGAC CCGGGGTCGA GCCCTCGTGA ACACCGCTCA GCTCCTGCAG TTTTCTCTGC 1260 
105 

106 ACGTGGCCGA GGGCATGGAG TACCTGGAGA GCAAGAAGCT TGTGCACCGC GACCTGGCCG 1320 
107 

108 CCCGCAACAT CCTGGTCTCA GAGGACCTGG TGGCCAAGGT CAGCGACTTT GGCCTGGCCA 1380 
109 

110 AAGCCGAGCG GAAGGGGCTA GACTCAAGCC GGCTGCCCGT CAAGTGGACG GCGCCCGAGG 1440 
111 

112 CTCTCAAACA CGGGAAGTTC ACCAGCAAGT CGGATGTCTG GAGTTTTGGG GTGCTGCTCT 1500 
113 

114 GGGAGGTCTT CTCATATGGA CGGGCTCCGT ACCCTAAAAT GTCACTGAAA GAGGTGTCGG 1560 
115 

116 AGGCCGTGGA GAAGGGGTAC CGCATGGAAC CCCCCGAGGG CTGTCCAGGC CCCGTGCACG 1620 
117 

118 TCCTCATGAG CAGCTGCTGG GAGGCAGAGC CCGCCCGCCG GCCACCCTTC CGCAAACTGG 1680 
119 

120 CCGAGAAGCT GGCCCGGGAG CTACGCAGTG CAGGTGCCCC AGCCTCCGTC TCAGGGCAGG 1740 
121 

122 ACGCCGACGG CTCCACCTCG CCCCGAAGCC AGGAGCCCTG ACCCCACCCG GTGGGGCCCT 1800 
123 

124 TGGCCCCAGA GGACCGAGAG AGTGGAGAGT GCGGCGTGGG GGCACTGACC AGGCCCAAGG 1860 
125 

126 AGGGTCCAGG CGGGCAAGTC ATCCTCCTGG TGCCCACAGC AGGGGCTGGC CCACGTAGGG 1920 
127 

128 GGCTCTGGGC GGCCCGTGGA CACCCCAGAC CTGCGAAGGA TGATCGCCCG ATAAAGACGG 1980 
129 

130 ATTCTAAGGA CTCTAAAAAA 2000 
131 

132 (2) INFORMATION FOR SEQ ID NO: 2: 
133 

134 (i) SEQUENCE CHARACTERISTICS: 

135 (A) LENGTH: 507 amino acids 

136 (B) TYPE: amino acid 

137 (C) STRANDEDNESS : unknown 

138 (D) TOPOLOGY: unknown 
139 

140 (ii) MOLECULE TYPE: protein 

141 

142 

143 

144 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

145 

146 Met Ala Gly Arg Gly Ser Leu Val Ser Trp Arg Ala Phe His Gly Cys 

147 15 10 15 
148 

149 Asp Ser Ala Glu Glu Leu Pro Arg Val Ser Pro Arg Phe Leu Arg Ala 

150 20 25 30 
151 

152 Trp His Pro Pro Pro Val Ser Ala Arg Met Pro Thr Arg Arg Trp Ala 



PAGE: 4 RAW SEQUENCE LISTING DATE: 08/28/95 

PATENT APPLICATION US/08/426,509 TIME: 13:46:20 

INPUT SET: S5863. raw 
153 35 40 45 

154 

155 Pro Oly Thr Gin Cys lie Thr Lys Cys Glu His Thr Arg Pro Lys Pro 

156 50 55 60 
157 

158 Gly Glu Leu Ala Phe Arg Lys Gly Asp Val Val Thr lie Leu Glu Ala 

159 65 70 75 80 
160 

161 Cys Glu Asn Lys Ser Trp Tyr Arg Val Lys His His Thr Ser Gly Gin 

162 85 90 95 
163 

164 Glu Gly Leu Leu Ala Ala Gly Ala Leu Arg Glu Arg Glu Ala Leu Ser 

165 100 105 110 
166 

167 Ala Asp Pro Lys Leu Ser Leu Met Pro Trp Phe His Gly Lys lie Ser 

168 115 120 125 
169 

170 Gly Gin Glu Ala Val Gin Gin Leu Gin Pro Pro Glu Asp Gly Leu Phe 

171 130 135 140 
172 

17 3 Leu Val Arg Glu Ser Ala Arg His Pro Gly Asp Tyr Val Leu Cys Val 

174 145 150 155 160 

175 

176 Ser Phe Gly Arg Asp Val lie His Tyr Arg Val Leu His Arg Asp Gly 

177 165 170 175 
178 

179 His Leu Thr lie Asp Glu Ala Val Phe Phe Cys Asn Leu Met Asp Met 

180 180 185 190 
181 

182 Val Glu His Tyr Ser Lys Asp Lys Gly Ala He Cys Thr Lys Leu Val 

183 195 200 205 
184 

185 Arg Pro Lys Arg Lys His Gly Thr Lys Ser Ala Glu Glu Glu Leu Ala 

186 210 215 220 
187 

188 Arg Ala Gly Trp Leu Leu Asn Leu Gin His Leu Thr Leu Gly Ala Gin 

189 225 230 235 240 
190 

191 He Gly Glu Gly Glu Phe Gly Ala Val Leu Gin Gly Glu Tyr Leu Gly 

192 245 250 255 
193 

194 Gin Lys Val Ala Val Lys Asn He Lys Cys Asp Val Thr Ala Gin Ala 

195 260 265 270 
196 

197 Phe Leu Asp Glu Thr Ala Val Met Thr Lys Met Gin His Glu Asn Leu 

198 275 280 285 
199 

200 Val Arg Leu Leu Gly Val He Leu His Gin Gly Leu Tyr He Val Met 

201 290 295 300 
202 

203 Glu His Val Ser Lys Gly Asn Leu Val Asn Phe Leu Arg Thr Arg Gly 

204 305 310 315 320 
205 
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RAW SEQUENCE LISTING DATE: 08/28/95 

PATENT APPLICATION US/08/426,509 TIME: 13:46:23 

INPUT SET: S5863.raw 
Arg Ala Leu Val Asn Thr Ala Gin Leu Leu Gin Phe Ser Leu His Val 
325 330 335 

Ala Glu Gly Met Glu Tyr Leu Glu Ser Lys Lys Leu Val His Arg Asp 
340 345 350 

Leu Ala Ala Arg Asn lie Leu Val Ser Glu Asp Leu Val Ala Lys Val 
355 360 365 

Ser Asp Phe Gly Leu Ala Lys Ala Glu Arg Lys Gly Leu Asp Ser Ser 
370 375 380 

Arg Leu Pro Val Lys Trp Thr Ala Pro Glu Ala Leu Lys His Gly Lys 
385 390 395 400 

Phe Thr Ser Lys Ser Asp Val Trp Ser Phe Gly Val Leu Leu Trp Glu 
405 410 415 

Val Phe Ser Tyr Gly Arg Ala Pro Tyr Pro Lys Met Ser Leu Lys Glu 
420 425 430 

Val Ser Glu Ala Val Glu Lys Gly Tyr Arg Met Glu Pro Pro Glu Gly 
435 440 445 

Cys Pro Gly Pro Val His Val Leu Met Ser Ser Cys Trp Glu Ala Glu 
450 455 460 

Pro Ala Arg Arg Pro Pro Phe Arg Lys Leu Ala Glu Lys Leu Ala Arg 
465 470 475 480 

Glu Leu Arg Ser Ala Gly Ala Pro Ala Ser Val Ser Gly Gin Asp Ala 
485 490 495 

Asp Gly Ser Thr Ser Pro Arg Ser Gin Glu Pro 
500 505 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CCGCTTTTTG CTTAGAGCTT GAGAGTCAAA GTTAAGGACC CACATGTATA CTTCGGCTCT 60 



PAGE:1 SEQUENCE VERIFICATION REPORT DATE: 08/28/95 

PATENT APPLICATION US/08/426,509 TIME: 13:46:26 

INPUT SET: S5863.mw 



Line Error Original Text 



4) Application No. 

NOTICE TO COMPLY WITH REQUIREMENTS FOR PATENT APPLICATIONS CONTAINING 
NUCLEOTIDE SEQUENCE AND/OR AMINO ACID SEQUENCE DISCLOSURES 

The nucleotide and/or amino acid sequence disclosure contained in this application does not comply with the requirements 
for such a disclosure as set forth in 37 CFR 1 .821 - 1 .825 for the following reason(s): 

I — I 1 . This application clearly fails to comply with the requirements of 37 CFR 1 .821 - 1 .825. Applicant's attention is 
directed to these regulations, published at 1114 OG 29, May 15, 1990 and at 55 FR 18230, May 1, 1990. 

□ 

□ 
□ 

□ 

□ 
□ 



2. This application does not contain, as a separate part of the disclosure on paper copy, a "Sequence Listing" as 
required by 37 CFR 1.821(c). 

3. A copy of the "Sequence Listing" in computer readable form has not been submitted as required by 37 CFR 1.821(e). 

4. A copy of the "Sequence Listing" in computer readable form has been submitted. However, the content of the 

computer readable form does not comply with the requirements of 37 CFR 1.822 and/or 1.823, as indicated on the 
attached marked-up copy of the "Raw Sequence Listing." 

5. The computer readable form that has been filed with this application has been found to be damaged and/or 

unreadable as indicated on the attached CRF Diskette Problem Report. A substitute computer readable form must be 
submitted as required by 37 CFR 1 .825(d). 

6. The paper copy of the "Sequence Listing" is not the same as the computer readable form of the "Sequence Listing" 
as required by 37 CFR 1.821(e). 



7. Other: 



Applicant must provide: 

I — I An initial or substitute computer readable form (CRF) copy of the "Sequence Listing" 

B^An initial or substitute paper copy of the "Sequence Listing", as well as an amendment directing its entry into the 
specification 

I — I A statement that the content of the paper and computer readable copies are the same and, where applicable, include no 
new matter, as required by 37 CFR 1.821(e) or 1.821(f) or 1.821(g) or 1.825(b) or 1.825(d) 

For questions regarding compliance with these requirements, pleacs contact: 

For Rules Interpretation, call (703) 308-1123 . 
For CRF submission help, call (703) 308-4212 
For Patentln software help, call (703) 308-6856 



Please return a copy of this notice with your response. 



